A System for the Semantic Interpretation of Unrestricted Domains using WordNet
نویسندگان
چکیده
In this demonstration, we show an algorithm for the semantic interpretation of unrestricted texts. The algorithm presents a solution for the following interpretation problems: determination of the meaning of the verb, identification of thematic roles and adjuncts, and attachments of prepositional phrases (PPs). An interesting aspect of the algorithm is that the solution of all these problems is interdependent. The interpretation algorithm uses WordNet (Miller et al. 1993) as its lexical knowledge-base. Predicates, or verbal concepts, have been defined for WordNet verb classes (Fellbaum 1993), which have been reorganized considerably following the criteria imposed by the interpretation algorithm. The WordNet ontology for nouns has also undergone some reorganization and redefinition to conform with the entries in the thematic roles of the predicates. We have taken a top-down approach that defines generic abstract predicates subsuming semantically and syntactically a large class of verbs. WordNet verb classes have been mapped into these generic predicates. Some of this mapping has required us to define new classes and to reclassify and/or redefine some WordNet classes and subclasses (Gomez 1998a). The predicates form a hierarchy in which thematic roles and inferences are inherited by subpredicates from their superpredicates. Two major consequences derive from anchoring verb classes in abstract semantic predicates: coalescing several WordNet senses into a predicate, which reduces the systemic polysemy in some WordNet senses, and mapping the same WordNet synset into distinct predicates. For instance, all the 5 synsets listed by WordNet for "travel": "travell, go, move, locomote;" "travel2, journey; .... travel3, take a trip, make a trip;" "travel4, journey;" and "travel5 (undergo transportation, as in vehicle)" are coalesced into the abstract semantic predicate change-of-location-by-animate. This predicate defines a class of verbs containing the most generic properties shared by all members of the class. The differentia between this predicate and its subpredicates are given by one or more of the following: a) specific selectional restrictions for the thematic roles, b) different syntactic realizations of the thematic roles, and c) specific sets of inferences associated with the subpredicates. For instance, the instrument of drive is always a vehicle, while the instrument of change-of-location-by-animate can be an animate, an animate body part, etc. The instrument of drive is never realized by a subject, but the instrument of the generic predicate can be realized by a subject, e.g., "This bus goes to Cambridge every Wednesday." On the other hand, migrate differs from change-of-location-by-animate only by the …
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملDevelopment of a Combined System Based on Data Mining and Semantic Web for the Diagnosis of Autism
Introduction: Autism is a nervous system disorder, and since there is no direct diagnosis for it, data mining can help diagnose the disease. Ontology as a backbone of the semantic web, a knowledge database with shareability and reusability, can be a confirmation of the correctness of disease diagnosis systems. This study aimed to provide a system for diagnosing autistic children with a combinat...
متن کاملWikipedia-based Semantic Interpretation for Natural Language Processing
Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, cal...
متن کاملDevelopment of a Combined System Based on Data Mining and Semantic Web for the Diagnosis of Autism
Introduction: Autism is a nervous system disorder, and since there is no direct diagnosis for it, data mining can help diagnose the disease. Ontology as a backbone of the semantic web, a knowledge database with shareability and reusability, can be a confirmation of the correctness of disease diagnosis systems. This study aimed to provide a system for diagnosing autistic children with a combinat...
متن کامل